62 research outputs found

    Single-Cell RNA-Seq Technologies and Related Computational Data Analysis

    Get PDF
    Single-cell RNA sequencing (scRNA-seq) technologies allow the dissection of gene expression at single-cell resolution, which greatly revolutionizes transcriptomic studies. A number of scRNA-seq protocols have been developed, and these methods possess their unique features with distinct advantages and disadvantages. Due to technical limitations and biological factors, scRNA-seq data are noisier and more complex than bulk RNA-seq data. The high variability of scRNA-seq data raises computational challenges in data analysis. Although an increasing number of bioinformatics methods are proposed for analyzing and interpreting scRNA-seq data, novel algorithms are required to ensure the accuracy and reproducibility of results. In this review, we provide an overview of currently available single-cell isolation protocols and scRNA-seq technologies, and discuss the methods for diverse scRNA-seq data analyses including quality control, read mapping, gene expression quantification, batch effect correction, normalization, imputation, dimensionality reduction, feature selection, cell clustering, trajectory inference, differential expression calling, alternative splicing, allelic expression, and gene regulatory network reconstruction. Further, we outline the prospective development and applications of scRNA-seq technologies

    Cross-platform comparison of SYBRÂŽ Green real-time PCR with TaqMan PCR, microarrays and other gene expression measurement technologies evaluated in the MicroArray Quality Control (MAQC) study

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The MicroArray Quality Control (MAQC) project evaluated the inter- and intra-platform reproducibility of seven microarray platforms and three quantitative gene expression assays in profiling the expression of two commercially available Reference RNA samples (<b><it>Nat Biotechnol </it></b>24:1115-22, 2006). The tested microarrays were the platforms from Affymetrix, Agilent Technologies, Applied Biosystems, GE Healthcare, Illumina, Eppendorf and the National Cancer Institute, and quantitative gene expression assays included TaqMan<sup>® </sup>Gene Expression PCR Assay, Standardized (Sta) <it>RT</it>-PCR™ and QuantiGene<sup>®</sup>. The data showed great consistency in gene expression measurements across different microarray platforms, different technologies and test sites. However, SYBR<sup>® </sup>Green real-time PCR, another common technique utilized by half of all real-time PCR users for gene expression measurement, was not addressed in the MAQC study. In the present study, we compared the performance of SYBR Green PCR with TaqMan PCR, microarrays and other quantitative technologies using the same two Reference RNA samples as the MAQC project. We assessed SYBR Green real-time PCR using commercially available RT<sup>2 </sup>Profiler™ PCR Arrays from SuperArray, containing primer pairs that have been experimentally validated to ensure gene-specificity and high amplification efficiency.</p> <p>Results</p> <p>The SYBR Green PCR Arrays exhibit good reproducibility among different users, PCR instruments and test sites. In addition, the SYBR Green PCR Arrays have the highest concordance with TaqMan PCR, and a high level of concordance with other quantitative methods and microarrays that were evaluated in this study in terms of fold-change correlation and overlap of lists of differentially expressed genes.</p> <p>Conclusion</p> <p>These data demonstrate that SYBR Green real-time PCR delivers highly comparable results in gene expression measurement with TaqMan PCR and other high-density microarrays.</p

    Employing “FDAlabel” Database to Extract Pharmacogenomics Information from FDA Drug Labeling to Advance the Study of Precision Medicine

    Get PDF
    Pharmacogenomics (PGx) focuses on how genomics and genetic variants (inherited and acquired) affect drug response. A better understanding of the association between genetic markers and individual phenotypes may improve therapy by enhancing drug efficacy, safety, and advance precision medicine. The FDALabel database (https://rm2.scinet.fda.gov/druglabel/#simsearch-0) was developed from the FDA\u27s Structured Product Labeling (SPL) repository to allow users to perform full-text and customizable searches of the labeling section {e.g. Boxed Warning, Warning and Precautions, Adverse Reaction (AR) sections}. In this study, 48 known biomarkers were used to query PGx relevant contents from the FDALabel database, including Indication, Clinical Pharmacology, Clinical Studies, and Use in Specific Populations. As a result, we identified 162 drugs out of 1129 small molecule drugs with PGx biomarker information. Furthermore, statistical analysis, pattern recognition, and network visualization were applied to investigate association of drug efficacy and severe ARs with PGx biomarkers and subpopulation. The results indicated that these drugs have a higher association with certain ARs in specific patient subpopulations (e.g., a higher association between CYP2D6 poor metabolizers and ARs caused by drugs for the treatment of psychiatric disoders ), and cover a broad range of therapeutic classes (e.g., Psychiatry, Cardiology, Oncology, and Endocrinology). FDALabel database (free publicly available) provides a convenient tool to navigate and extract PGx information from FDA-approved drug. The knowledge gained from these drugs and biomarkers in this study will enhance the understanding of PGx to advance precision medicine

    Role of CD44 in clear cell renal cell carcinoma invasiveness after antiangiogenic treatment

    Get PDF
    Treballs Finals de Grau de Farmàcia, Facultat de Farmàcia, Universitat de Barcelona, 2017. Tutor/a: Joan Carles Rodríguez Rubio.[eng] During last century, big effort to understand the biochemical basis of cancer was carried out. One of the principal branches of these cancer investigations used drugs to prevent the formation of new blood vessels –process called angiogenesis– responsible for the nutrients supply of the tumour. These drugs are generally called antiangiogenics. It was discovered that some types of tumour have or develop resistance to these drugs when treatment was long enough. For that reason, mechanisms of resistance, aggressiveness, invasion and/or metastasis after the treatment are nowadays relevant to study. Recently, a protein that could be involved in the increased invasiveness of tumour cells after the antiangiogenic treatment appeared. This project collects some evidence that indicates that this protein, called CD44, might play a role in the increased invasion after antiangiogenic treatment in mouse models of renal carcinoma.[cat] Durant l’últim segle, s’ha fet un gran esforç per aprofundir en la basant bioquímica de la investigació contra el càncer. Una de les branques principals d’aquesta investigació utilitza fàrmacs que prevenen la formació de nous vasos sanguinis –procés anomenat angiogènesis- encarregats de nodrir el tumor. Aquests fàrmacs es diuen generalment antiangiogènics. S’ha descobert que alguns tipus de tumor tenen o desenvolupen resistència a aquests fàrmacs quan el tractament és prou llarg. Per aquesta raó, actualment s’està investigant profundament quins són els mecanismes pels quals apareix aquesta resistència, així com també perquè els tumors es tornen més agressius, invasius i/o metastàtics després del tractament. Recentment s’ha descobert una proteïna que podria estar involucrada en l’augment de la invasivitat de les cèl·lules tumorals després del tractament antiangiogènic. Aquest treball recull algunes de les evidències que apunten cap al paper de la proteïna CD44 en l’increment de la invasió tumoral post-tractament amb fàrmacs antiangiogènics en models ratolins de càncer renal

    Two new ArrayTrack libraries for personalized biomedical research

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Recent advances in high-throughput genotyping technology are paving the way for research in personalized medicine and nutrition. However, most of the genetic markers identified from association studies account for a small contribution to the total risk/benefit of the studied phenotypic trait. Testing whether the candidate genes identified by association studies are causal is critically important to the development of personalized medicine and nutrition. An efficient data mining strategy and a set of sophisticated tools are necessary to help better understand and utilize the findings from genetic association studies. </p> <p>Description</p> <p>SNP (single nucleotide polymorphism) and QTL (quantitative trait locus) libraries were constructed and incorporated into ArrayTrack, with user-friendly interfaces and powerful search features. Data from several public repositories were collected in the SNP and QTL libraries and connected to other domain libraries (genes, proteins, metabolites, and pathways) in ArrayTrack. Linking the data sets within ArrayTrack allows searching of SNP and QTL data as well as their relationships to other biological molecules. The SNP library includes approximately 15 million human SNPs and their annotations, while the QTL library contains publically available QTLs identified in mouse, rat, and human. The QTL library was developed for finding the overlap between the map position of a candidate or metabolic gene and QTLs from these species. Two use cases were included to demonstrate the utility of these tools. The SNP and QTL libraries are freely available to the public through ArrayTrack at <url>http://www.fda.gov/ArrayTrack</url>. </p> <p>Conclusions</p> <p>These libraries developed in ArrayTrack contain comprehensive information on SNPs and QTLs and are further cross-linked to other libraries. Connecting domain specific knowledge is a cornerstone of systems biology strategies and allows for a better understanding of the genetic and biological context of the findings from genetic association studies. </p

    Assessing Reproducibility of Inherited Variants Detected With Short-Read Whole Genome Sequencing

    Get PDF
    Background: Reproducible detection of inherited variants with whole genome sequencing (WGS) is vital for the implementation of precision medicine and is a complicated process in which each step affects variant call quality. Systematically assessing reproducibility of inherited variants with WGS and impact of each step in the process is needed for understanding and improving quality of inherited variants from WGS. Results: To dissect the impact of factors involved in detection of inherited variants with WGS, we sequence triplicates of eight DNA samples representing two populations on three short-read sequencing platforms using three library kits in six labs and call variants with 56 combinations of aligners and callers. We find that bioinformatics pipelines (callers and aligners) have a larger impact on variant reproducibility than WGS platform or library preparation. Single-nucleotide variants (SNVs), particularly outside difficult-to-map regions, are more reproducible than small insertions and deletions (indels), which are least reproducible when \u3e 5 bp. Increasing sequencing coverage improves indel reproducibility but has limited impact on SNVs above 30×. Conclusions: Our findings highlight sources of variability in variant detection and the need for improvement of bioinformatics pipelines in the era of precision medicine with WGS

    Assessing reproducibility of inherited variants detected with short-read whole genome sequencing

    Get PDF
    Background: Reproducible detection of inherited variants with whole genome sequencing (WGS) is vital for the implementation of precision medicine and is a complicated process in which each step affects variant call quality. Systematically assessing reproducibility of inherited variants with WGS and impact of each step in the process is needed for understanding and improving quality of inherited variants from WGS. Results: To dissect the impact of factors involved in detection of inherited variants with WGS, we sequence triplicates of eight DNA samples representing two populations on three short-read sequencing platforms using three library kits in six labs and call variants with 56 combinations of aligners and callers. We find that bioinformatics pipelines (callers and aligners) have a larger impact on variant reproducibility than WGS platform or library preparation. Single-nucleotide variants (SNVs), particularly outside difficult-to-map regions, are more reproducible than small insertions and deletions (indels), which are least reproducible when > 5 bp. Increasing sequencing coverage improves indel reproducibility but has limited impact on SNVs above 30x. Conclusions: Our findings highlight sources of variability in variant detection and the need for improvement of bioinformatics pipelines in the era of precision medicine with WGS.Peer reviewe
    • …
    corecore